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Measuring the Higgs boson mass in dileptonic VF-boson decays at hadron colliders 
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It is expected that hadron collider measurements of the Higgs boson mass using the decay h — > 
W^W' , followed by the leptonic decay of each W-boson, will be performed by fitting the shape of 
a distribution that is sensitive to the Higgs mass. We demonstrate that the variable most commonly 
used to measure the Higgs mass in this channel is not optimal as it contains an unnecessary and 
even counter-productive approximation. We remove that approximation, without introducing any 
cost in complexity, and demonstrate that the new variable is a clear improvement over the old: its 
performance is never worse, and in some cases (particularly the high Higgs mass region) it might 
reduce the fit uncertainty on the Higgs mass in that channel by a factor approaching two. 



Introduction 

The mass of the Higgs boson is the last unknown pa- 
rameter of the Standard Model. Here, wc present a 
method to measure it at a hadron collider, assuming the 
Higgs exists and is sufficiently massive {rrih > 130 GeV) 
that it decays predominantly to M^-bosons. The method 
is based on the transverse mass observable, rriT, that was 
originally used to measure the masses of the M^-bosons 
themselves, via their decays, W — *■ Iv, to a lepton and 
a neutrino. There, since the neutrino is invisible in a 
detector, one cannot simply reconstruct the mass of the 
parent W from the invariant mass of the li> daughter sys- 
tem; the transverse mass ttit circumvents this problem. 
Similarly, in the case of Higgs decays to two Ws (one or 
more of which may be significantly off-mass-shell), then 
if the Ws subsequently decay leptonically to Iv, we end 
up with two invisible neutrinos in the final state. We will 
describe a generalization of mx whose distribution fea- 
tures an edge, which will enable us to extract rrih directly. 
We believe that the method both complements, and im- 
proves upon, existing strategies (ESQ for measuring m/j 
in this channel, and we encourage experiments to make 
use of it. The distribution should also aid ongoing Higgs 
searches at the Tevatron 0, 0] . 

We also briefly discuss potential applications to mass 
measurement of other particles at the LHC, for example 
new resonances (such as Kaluza-Klein gluons from an 
extra dimension) that decay to ti, as well as the lightest 
stable superpartner (LSP) in supersymmetric theories. 

The original application of the transverse mass was in 
measurement of mw [1, 0, S] ■ We define 



m^ 



ml +mf + 2{eyei 



(1) 



where p is the momentum transverse to the beam, e = 
• p + denotes the transverse energy, and v and i 
label the visible and invisible decay products respectively, 
(a charged lepton and a neutrino in the case at hand). 

This definition of mr has two desirable features: first, 
since the mass of the neutrino is unknown, but negligible. 



and the transverse momentum of the neutrino can be 
inferred from the missing transverse momentum in the 
event, uit is indeed an observable; second, niT is always 
bounded above by the mass mw of the parent W. This 
is easily shown using the invariant mass constraint 



2{EyEi-py-p,~qyq,), (2) 



where q is the longitudinal momentum and E = 
\/ q"^ + p ■ p + m"^ is the energy, together with the lemma 

E^Ei - q^qi > e^e^, (3) 

with equality at E^qi = Eiqy, which the reader may eas- 
ily prove for himself. Thus, by computing the distribu- 
tion of Tot in many events, myy appears as the upper 
endpoint. (In practice, the finite decay width of the W 
and other effects lead to myy appearing as a Jacobian 
peak in the data.) 

Recently, a number of g eneralizations of tot have ap- 



peared ^ M, El, E, El, [3, [Ts, 16, 17,, HI, with di- 
verse applications for LHG mass measurements. They 
include generalizations to: decays with multiple visible 
daughters; decays with a massive invisible daughter (such 
as a DM candidate); and decays of pair- produced par- 
ent particles. The last of these has already been used 
to measure the mass of the top quark in the process 
tt bbW+W- hbl+l-vV at the Tevatron [11. 



More Invisibles 

There is one other generalization that can be made, 
which is to situations where a single decay in itself con- 
tains more than one invisible daughter. Practical ex- 
amples include the single Higgs decay h WW'^*^ 
£'^£"1/9, the decay of new resonances (such as a Kaluza- 
Klein gluon) to ti, followed by a semi-leptonic decay of 
each top, t bW blv, or pair decays in supersym- 
metric theories with both the lightest superpartner and 
neutrinos in the final state. To generalize tot to such a 



2 



situation, the obvious thing to do is to replace in ([T]) 
by the invariant mass of the invisible system [2^. Now, 
in any event, rrii goes unobserved, but it is useful nev- 
ertheless to consider its properties. A first observation 
is that rrii, though a relativistic invariant, now varies 
from event to event, taking values on some real, posi- 
tive interval. The endpoints of this interval, m^g, are 
fixed by the particular decay topology. For example, if 
a parent of mass mo undergoes a pointlike three-body 
decay to one visible particle of mass ruy and two mass- 
less invisible particles, the lower and upper endpoints 
are given by rni< = and mi> = toq ~ respec- 
tively, whereas if the decay involves an intermediate res- 
onance of mass m/, they are given by TOi< = and 
rriiy — (mp — 'm'j){m'j — m'l)/rn?j. 

What is more, it is easy to show that rriT is a mono- 
tonically increasing function of m^. We thus have the 
chain of inequalities 



(4) 



If mi< is known, then mT(mi<:) is an observable that 
is bounded above by mo; if mi< is unknown, we can 
determine it using a, generalization of the kink method 
described in [H, [H [ij, [11] . 

Higgs Decays 

For the Higgs decay h WW^*'> -> £+£-iyi?, it is 
simple enough to show that mi< = 0, when we ignore 
the mass of the neutrinos. To wit, consider the on-shell 
decay h WW t^l^vv, with h at rest in the lab- 
oratory, in which the two Wb are emitted back-to-back. 
Then let the two Ws decay such that the neutrinos are 
emitted parallel to each other (not anti-parallel). In this 
configuration, rrii = 0. Since vrii is positive semi-definite, 
"T'i< = 0. Similar arguments apply to the off-shell decay 
h WW*. 

That the inequalities in ([4]) can be made into equal- 
ities also follows from the existence of these kinematic 
configurations. Thus, by computing 

(m^™<=)2 = ml{m, = 0) = ml + 2(e,|p,| - • p,), (5) 

in many events, we should obtain a distribution in m^"® 
whose endpoint yields the mass of the Higgs boson. Since 
the observable defined in ([5]) is truly bounded above by 
mo, we distinguish it from other transverse- mass-like ob- 
servables by giving it the label w^f^'^. 

In work to date [l|, [2l|, [S^l , an alternative transverse 
mass has been used. 



rrin 



mT{mi — rriy). 



(6) 



masses close to 2mw and produced at or near thresh- 
old, each W boson will decay almost at rest, therefore 
nii ~ rriy. We note though that m'^^^^°^ is not bounded 
above by mo. Not knowing m^ and without using the 
above approximation, the best lower limit we can place 
on iTLfi will be with the true transverse mass ([S]). 



h WW^*^ simulation 

To investigate the relative performance of the alter- 
native transverse mass variables ([5]) and (O we use the 
HERWIG 6.505 [H,!!!] Monte Carlo generator, with LHC 
beam conditions {y/s = 14 TeV). Our version of the 
generator includes the fix to the h WW^*'> spin corre- 
lations described in Our simulations do not include 
all corrections from higher orders in as (see e.g. [2I] for a 
comparison). These will be important to consider when 
later comparing against real experimental distributions. 

We generate unweighted events for Standard Model 
Higgs boson production [gg —^ h) and for the domi- 
nant background, qq~^ WW} Final state hadrons with 
Pt > 0.5 GeV and pseudorapidity jTyj < 5 are clustered 
into jets using the longitudinally invariant kr clustering 
algorithm for hadron-hadron collisions 27[ in the inclu- 
sive mode (28)] with R = 1.0. The missing transverse mo- 
mentum Pi is calculated from the vector sum of the trans- 
verse momenta of the neutrinos. No detector simulation 
is applied in this paper. Detector effects should provide a 
relatively small correction since lepton momenta are very 
well measured at these energies [l|, 0] , and the dominant 
contribution to the missing transverse momentum p,j will 
be from recoil against well-measured leptons. There will 
be some additional smearing of p^ from mismeasured and 
out-of-acceptance hadrons but such corrections are small 
when the hadronic transverse energy in the event is small 

8- n 

Selection cuts are applied based on pj, requiring: 

• Exactly two leptons £ £ {e, /i} with pr > 15 GeV 
and \r]\ < 2.5 

• Missing transverse momentum, > 30 GeV 

• 12 GeV < mu < 300 GeV 

• No jet with pr > 20 GeV 

• Z —f TT rejection: the event was rejected if Imrr — 
mz\ < 25 GeV and < < 1 for both i G {1, 2}^ 



The justification for replacing the unknown m^ by the ob- 
servable TUy in those papers is that for Higgs bosons with 



^ Other backgrounds, such as Z ^ 2t, are rendered sub-dominant 

by the cuts discussed below [Ij. 
^ The variable Xi is the momentum fraction of the ith tau carried 

by its daughter lepton and m-rT is the di-tau invariant mass. 

They are calculated using the approximation that each t was 

collinear with its daughter lepton. 
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FIG. 1: Signal-only distributions of m'^'^'^"^ (top) and my" 
(bottom) for various values of rrih (in GeV) . No cuts on A<pff^ 
and Pt^ww have been applied. 



• Relative azimuth A(t)u < ^'PeT'^ 

• Transverse momentum of the W pair system, 

PT WW > Pt WW 

As has been done in previous studies [l| , we optimize the 
values of the latter two cuts, Acji^i^^ and Pxww 
Higgs boson mass. In this case we select the values which 
would be predicted to best constrain irih - experimentally 
one would select cuts which would give the best expected 
measurement once an approximate Higgs boson mass was 
known. The optimal values ranged from 1.4 to 2.4 for 
A(j3'll'' and to 10 GeV for p^%y^. 

Both observables correlate with rrih (see Figure [T|) , so 
it is possible to make a mass measurement with either. 
However m^^^°^ does not provide a strict event-by-event 
lower bound on m/j, whereas the kinematic endpoint of 
the m^^^ distribution shows a clear edge at nih- 

To examine the relative performance of the two vari- 
ables, we generate distributions of them for various 
choices of to/j. This is done for twenty independent 
pseudo-experiments (including both signal and the dom- 
inant WW background contributions), each correspond- 
ing to integrated luminosity of 10 fb^^. Each pseudo- 
experiment is compared to (signal and WW background) 
model distributions with differing hypotheses of m'^°'^°^. 

An example pseudo-experiment distribution for 771^"° 
is shown in Figure [2l For each pseudo-experiment the 
binned log likelihood of the data is calculated. Each 
likelihood is maximised over the normalisations of the 
model h WW signal and WW background distribu- 
tions, reflecting our uncertainty in the cross-sections and 



Pseudodata 
H->WW sic 
WW background 
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FIG. 2: Example pseudo-experiment m^"" distribution 
(points with error bars) and model distribution (shaded his- 
tograms) for integrated luminosity of 10 fb~^. The plot in- 
cludes the WW background and is made for ruh = m™°'^"' = 
180 GeV, A(^S"" = 1.8 and p^%-w = 10 GeV. 
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FIG. 3: Relative log likelihood distributions for various Higgs 
boson masses for each of several different input masses and 
for both 771^'^'"^°'' (dashed) and my"° (solid). The points cor- 
respond to integrated luminosity of 10 fb~^, and are plotted 
for mh e {130, 140, 150, 160, 170, 180, 200, 220, 240} GeV. 



luminosity : 



log C{mh,mr''°') 



{ max ^ log/: 



\max 
/SIG 
/bg 



trials 



^ While uncertainties in the shapes of those distributions are also 
likely to be important, they are difficult to estimate without col- 
lision data and so are not considered in this paper. We note that 
shape effects are likely to be more detrimental for m^'^'^"^; the 
position of the kinematic edge in ""^ should be robust against 
uncertainties in smoothly-varying background parameters. 
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Other Applications 
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There are many other possible decay processes at the 
LHC involving multiple invisible daughters, to which sim- 
ilar methods might be applied. One is to decays of new 
resonances, such as a Kaluza-Klein gluon from an ex- 
tra dimension [2^ . in the tt channel, followed by semi- 
leptonic decays of the tops. For heavy resonances (exist- 
ing constraints suggest that a KK gluon should be multi- 
TeV, for example), the approximation will certainly be 
inappropriate. 

A second example is supersymmetric decays involving 
both the LSP and neutrinos. There, we do not know 
the mass of the LSP and we are forced to resort to a 
kink-based method, as in 12, 13, 14, 15l |. 



FIG. 4: The bands show the fractional uncertainty with 
which one could expect to measure rrih with mfj?'^'^"^ (black) 
and my"° (shaded) as a function of m^. The integrated lu- 
minosity simulated is 10 fb~^. 



where the sum is over histogram bins, £^{n;x) is the 
Poisson likehhood and a;i(/siG, /bGj "i™"*^*^') is the ex- 
pected number of events if signal and background cross 
sections are / times their leading-order Monte Carlo pre- 
dictions. The angle brackets indicate an average over the 
twenty pseudo-experiments. 

The resulting curves of — 2Alog£ are plotted in Fig- 
ure[31 where A indicates the difference from the minimum 
value. The relative precision with which each method can 
be expected to measure the Higgs boson mass is deter- 
mined from a quadratic fit to — 2Alog£ around the min- 
imum. The fractional uncertainties (Figure S]) show that 
the true transverse mass performs somewhat better than 
the approximate version for all m/i, so there appears to 
be no advantage in making the approximation rui « m^. 
When ruh > 2mw there is a significant penalty to pay for 
assuming mi « my — the true transverse mass provides 
the higher-precision measurement. 

The absolute uncertainties (for both variables) will ob- 
viously be somewhat broadened when experimental reso- 
lution and sub-leading backgrounds are included. While 
such detailed simulations are beyond the scope of this 
paper, we project that the desirable properties of m^"° 
will mean it is also the more appropriate variable in the 
real world. 

One might also expect m^'"° to be a good selection 
variable for Higgs boson discovery and for measuring the 
product of cross-section and branching ratio for Higgs 
production and di-leptonic decay, by counting the num- 
ber of signal events. Indeed, as discussed in the Ap- 
pendix, we find that m^"° again gives an improvement, 
albeit a slight one, over m^^'^"^ in both cases. 



Conclusions 

There seems to be no advantage in using the approxi- 
mate version of the transverse mass - whether for Higgs 
boson discovery, for mass determination or for measur- 
ing event rates. Indeed our simulations show that the 
approximation is often counter-productive, particularly 
if the objective is to make a Higgs boson mass measure- 
ment and especially when > 2mw- The true trans- 
verse mass is easy to calculate, and (unlike the approxi- 
mate version) provides an event-by-event lower bound on 

These results should be cross-checked with more de- 
tailed studies with: full detector simulation; more sophis- 
ticated models for the signal and background distribution 
shape uncertainties; and with calculations to higher or- 
ders in Os-^ Future work should also consider the case of 
Higgs boson production via vector boson fusion for which 
one might expect rather similar results. 

Other examples of processes where this generalization 
of mx could be used include Kaluza-Klein gluon decays 
gKK tt, where the top quarks decay via Icptonic W 
bosons, and supersymmetric decays involving neutrinos, 
such as x'^ ii^Xi- 

Although we have focussed our attention here on the 
decay h WW, it is worth remarking that, in the case 
of an Standard Model Higgs boson with m^ > 2m z, the 
decay channel h 2Z Al will allow the Higgs mass to 
be measured at the per mille level. Nevertheless, the de- 
cay h — > WW would provide an important corroborative 
measurement. 

We are grateful to Bryan Webber for making us aware 



* A subsequent study with full detector simulation [soj confirms 
our results and, in addition, suggests that the true transverse 
mass appears to have the advantage over ra^'^'^°^ of being less 
correlated with A<pii. 
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FIG. 5: Higgs boson discovery potential as a function of 
nih- The center of eacli band indicates the difference in log 
likelihood between models with and without a Higgs Boson 
contribution. Lower values correspond to better discovery 
potential. The half-width of the each band gives the root- 
mean-squared over trial samples. The integrated luminosity 
simulated is 10 fb~^. 



-2Alog£ = -2(log£-log£^)^^.j^,^, 

where is the likeUhood of the trial data when the 
model contains no Higgs boson contribution. C is max- 
imised over /siG, /bg and rn™°'^'^'; is maximised over 
/bg- 

Plots of this — 2Alog£ are shown as a function of 
nih in Figure El The absolute numbers are optimistic, 
since the discovery potential will be reduced by subdom- 
inant backgrounds and detector resolution, but the rela- 
tive performance of the two variables is meaningful. One 
can see that the plots are rather similar, but there may 
be a small advantage in using rn^"° rather than m!^'^^°^ 
when rrih > 2mw- 

The branching ratio of the Higgs boson to W bo- 
son pairs is another parameter of significant interest. 
The relative precision with which one could measure 
the number of signal events, which is proportional to 
cr{pp ^ h) X BR{h WW^*^), was determined from 
quadratic fits to 
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FIG. 6: Precision with which one could expect to measure 
the Higgs boson cross-section times branching ratio to M^M^'*-* 
as a function of rrih- The integrated luminosity simulated is 
10 fb~\ 
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Appendix: Higgs discovery and branching ratio 



To quantify the Higgs boson discovery potential using 
m^^° as a selection variable, we calculate the log likeli- 



max 



5:iog£^ 



atrial . 



trials 



for various m/j. The resulting fractional uncertainty 
bands are shown in Figure[6l There is again an advantage 
in using m^^", though in this case it is slight. 
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